Mon. Not. R. Astron. Soc. OOP. [THT2lf 0000*) Printed 2 April 2012 (MN ICT^ style file v2.2) 

The luminosities of the brightest cluster galaxies and 
brightest satellites in SDSS groups 



(N 

o 



o 

en 

o 

in: 



(N 

>: 

(N 

<^ 

o 



Aseem Paranjape^* & Ravi K. Sheth^'^ 

^ The Abdus Salam International Center for Theoretical Physics, Strada Costiera 11, 34151 Trieste, Italy 
^ Center for Particle Cosmology, University of Pennsylvania, 209 S. 33rd St., Philadelphia, PA 19104, USA 



2 April 2012 



^; 



ABSTRACT 

We show that the distribution of himinosities of Brightest Cluster Galaxies in an 
SDSS-based group catalog suggests that BCG luminosities are just the statistical 
extremes of the group galaxy luminosity function. This latter happens to be very 
well approximated by the all-galaxy luminosity function (restricted to Mr < —19.9), 
provided one uses a parametrization of this function that is accurate at the bright end. 
A similar analysis of the luminosity distribution of the Brightest Satellite Galaxies 
suggests that they are best thought of as being the second brightest pick from the 
same luminosity distribution of which BCGs are the brightest. I.e., BSGs are not 
the brightest of some universal satellite luminosity function, in contrast to what Halo 
Model analyses of the luminosity dependence of clustering suggest. However, we then 
use mark correlations to provide a novel test of these order statistics, showing that 
the hypothesis of a universal luminosity function (i.e. no halo mass dependence) from 
which the BCGs and BSGs are drawn is incompatible with the data, despite the fact 
that there was no hint of this in the BCG and BSG luminosity distributions themselves. 
We also discuss why, since extreme value statistics are explicitly a function of the 
number of draws, the consistency of BCG luminosities with extreme value statistics is 
most clearly seen if one is careful to perform the test at fixed group richness N. Tests 
at, e.g., fixed total group luminosity Ltot, will generally be biased and may lead to 
erroneous conclusions. 
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1 INTRODUCTION 

The study of whether or not the brightest cluster galaxy (the 
BCG) is special, rather than simply being the brightest by 
chance, has a long history (Scott 1957; Schechter & Peebles 
1976; Tremaine & Richstone 1977; Bhavsar & Barrow 1985; 
Loh & Strauss 2006; Lin, Ostriker & Miller 2010, Dobos & 
Csabai 2011). Since the BCG is usually at or close to the 
cluster center, it is plausible that its formation history was 
dominated by different physical processes compared to the 
other galaxies in the cluster, which we will call satellites. For 
this reason, models of BCG formation routinely assume that 
BCGs are special (e.g. Milosavljevic, et al. 2006; De Lucia 
& Blaizot 2007). And there is growing evidence from scal- 
ing relations that BCGs are indeed special: they have larger 
sizes, smaller velocity dispersions, smaller color gradients, 
and are less spherical than expected for their luminosities 
(e.g. Bernardi et al. 2011). But whether or not this has left 
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a distinct signature in the distribution of BCG luminosities 
is an open question. 

There are at least two reasons why the answer is in- 
teresting. First, Halo Model (see Cooray & Sheth 2002 for 
a review) interpretations of the luminosity dependence of 
galaxy clustering explicitly distinguish between the central 
and the other (satellite) galaxies in a halo. If BCGs are just 
the brightest of a universal galaxy luminosity function, then 
including this constraint significantly simplifies such analy- 
ses. 

The second is that the distribution of BCG luminosi- 
ties is considerably narrower than that of all galaxies, so the 
large number of clusters that will soon be available may per- 
mit BCGs to provide a useful consistency check of standard 
candle constraints on the luminosity-distance relation (Hub- 
ble 1936; Scott 1957; Sandage, Kristian & Westphal 1976). 
Alternatively, if the di^{z) relation is known and BCGs are 
just statistical extremes, then the BCG luminosity function 
can be used to provide a rather accurate determination of 
the evolution of the galaxy luminosity function. Since BCGs 
are bright, they are amongst the easiest targets for spec- 
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troscopy, so this measure of galaxy evolution conies at a 
small fraction of the cost of a full galaxy survey. On the 
other hand, if they are special in other ways, one must also 
understand how the physical processes which affected their 
formation evolve. 

In what follows we will use the Halo Model analyses to 
motivate why one might have thought that the BCG lumi- 
nosity function might simply be given by the extreme value 
statistics of the all galaxy luminosity function, even though 
centrals are often explicitly treated as being special. In this 
case, it is natural to ask if the luminosity function of the 
brightest satellite galaxies - the BSG luminosity function - is 
simply given by the associated order statistics of the second 
brightest (rather than the brightest) object. Then we will 
show that Halo Model analyses also suggest that the BSG 
luminosity function should be given by the extreme value 
statistics of a satellite galaxy luminosity function, rather 
than the order statistics of the all galaxy luminosity func- 
tion. We will show that both statements cannot be right, so 
we turn to the data to decide the issue. 

Section [5] summarizes the main results from Order 
Statistics and the Halo Model approach which are relevant 
to this study. Section |3] uses a group catalog of SDSS galax- 
ies from Berlind et al. (2006) (henceforth B06) to test these 
models regarding BCGs and BSGs. These tests include the 
luminosity functions of BCGs and BSGs, the ratio of the 
luminosities of the first and second brightest galaxies in 
a cluster, and the luminosity function of satellite galaxies 
(i.e., not just the BSGs). Section |4] shows that luminosity 
weighted clustering provides a novel test of Order Statistics. 
A final section summarizes. Appendix [X] provides a deriva- 
tion of the satellite distribution if the BCGs satisfy extreme 
value statisics. Appendix[B] demonstrates that tests of order 
statistics are better made holding the number of galaxies 
fixed, rather than other parameters such as group luminos- 
ity, etc. And Appendix [C] describes the details of the Halo 
Model that are relevant to our study. 



2 PRELIMINARIES 

2.1 Extreme value and order statistics 

The probability that the largest of A'' independent draws 
from a distribution p{L) is less than L is given by 



gii< L\N) = P{< LY 



(1) 



where P(< L) = J^ dL'p(I/'), and we will define the value 
of I/min later. The associated differential distribution is 

g^iL\N)^Np{L)Pi<Lf-\ (2) 

Similarly, the distribution of the second largest of the N 
draws satisfies 

<?2« L\N) = P(< L)"" + r] P{> L) P{< Lf-' , (3) 

so that 

g2iL\N) = N(N ~ l)p(L) P(> L) P(< L)^-^ . (4) 
More generally, the nth largest of N draws obeys 



g4L\N) 



In what follows we will be interested in whether or not 
the distribution of BCG and BSG luminosities are given by 
equations © and ^ respectively, with p{L) given by the 
luminosity function of all galaxies, although clearly, similar 
tests with the other values of n could also be devised. 

If we define the satellite luminosity function as the dis- 
tribution which is obtained by subtracting the distribution 
of centrals (assumed to he gi{> L\N)) from that of all galax- 
ies, then one might ask if the luminosity function of BSGs is 
given by inserting this distribution of satellite luminosities 
for p in the expression above for gi . In AppendixlX] we show 
that this is not the same as g2. We will argue shortly that 
the Halo Model suggests that both of these models might be 
acceptable for satellites. 

When discussing the group catalog of BOG, we will use 
the luminosity distribution pgai (I/) derived from equation (9) 
of Bernardi et al. (2010), restricted and normalized to L > 
Lniin, for some Lmin which we will specify shortly. Explicitly, 



iPgal(i) = 



P {L/L, 



(6) 



The parameters in Table Bl of Bernardi et al. assumed that 
absolute magnitudes brighten as 1.32, and were quoted at 
z = 0. Since the B06 sample is based on absolute magnitudes 
that were fc-corrected and evolution corrected to 2 = 0.1, we 
adopt an increased value L* (by a factor \{f'^>^^-^>^^-^^ but 
leave the other parameters a and /? unchanged. I.e., we use, 

L, = 0.104 X 10^°iQ/i~^ ; OL = 1.12 ; /3 = 0.533 . (7) 

This fit differs slightly from the usual Schechter function of 
Blanton et al. (2003) (their Table 2): 

L. = 1.202 X lO^°L0/i"^ ; a = -1.05 ; /3 = 1 . (8) 

Before moving on, note that the median value of g\ 
occurs at luminosity I/1/2, where 



(1/2) 



/2, 
1/JV 



P{<L 



l/2j 



(9) 



If p{L) were simply an exponential distribution, then P{> 
L) = exp[-(L - Ln,in)/L,], so -ln(2)/A = ln[l - P(> 
L1/2)] ~ -^(> ^1/2), and L1/2/L, = Lm^r^jL^ - lnln(2) -f 
ln(A'^) would grow logarithmically with N . It is the fact 
that lnLi/2 is an even weaker function of A which makes 
BCGs standard candles (if they are indeed just statistical 
extremes) . 

Similarly, if we define 1/0.84 by 0.84 — P{< io.84)^, 
then the difference between it and L1/2 is a measure of the 
width of gi . For an exponential distribution this is given by 
AL = Lo.84 - Li/2 = L, ln[(l - (0.5)i/^)/(l - (0.84)i/^)] « 
1.38 L*. li Lmin/L, -C 1, this means that AL/L1/2 decreases 
as 1.38/ ln(A), meaning BCGs in richer clusters are better 
standard candles. 

If p is given by equation (|6]) using equation (O and 
Lmin = 0.747 X 10^° LQh-^, then L1/2 = (3.24,3.71) x 
lOi°L0/i-2 and AL = (1.69, 1.78) x W^^Leh-^ for N = 



np{L) P{> L)"" P(< L)' 



(5) 



(10, 15), respectively (see also Figure iBTjl . 

Notice that order statistics such as these are explicitly 
a function of the number of draws N. Therefore, the natural 
way to test if, e.g., BCGs are well- fit by equation ((2)1 with 
p{L) given by the all-galaxy luminosity function of equa- 
tion (0 is to perform tests on groups which have the same 
N (rather than the same mass, total luminosity, etc.). We 
discuss this further in Appendix iBl 
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2.2 The Halo Model approach 

The Halo Model approach assumes that galaxies reside in 
dark matter halos, each of which may host more than one 
galaxy. Measurements of how the number density and clus- 
tering strength depend on galaxy luminosity are used to 
determine how galaxies populate halos (e.g., A'' oc Mhaio?) 
There are two ways in which this is usually done. 

The first is known as the Halo Occupation Distribution 
(HOD) approach (e.g. Zehavi et al. 2005, 2011). Here, one 
determines how the number of galaxies brighter than some 
Lmin must scale with halo mass so as to produce the ob- 
served number density and clustering strength. In detail, the 
fitting assumes that, in halos which host at least one galaxy, 
the distribution of the number of additional galaxies follows 
a Poisson distribution. Since a Poisson distribution has one 
free parameter, the HOD approach aims to characterize how 
this parameter depends on halo mass and Lmin. By repeating 
the analysis for a range of Lmin values, this approach, in ef- 
fect, provides a determination of how the galaxy luminosity 
function depends on halo mass. The additional assumption 
that a halo must contain a central galaxy before it can host 
satellites allows one to interpret the HOD findings in terms 
of centrals and satellites. In what follows we will describe 
a few conclusions which follow from this assumption. The 
ease with which such conclusions can be derived has meant 
that it is now conventional to even phrase the HOD explic- 
itly in terms of a 'central + Poisson-satellites' model, before 
fitting to data. We discuss a simplified HOD model below 
to emphasize our main points; in Appendix C we describe 
the more sophisticated implementation from Zehavi et al. 
(2011), which we will use for comparison with data. 

The HOD analysis assumes that the number of satel- 
lites brighter than L scales as a power-law in halo mass, 
[M/Mi{L)]°'^^\ where Mi is the halo mass scale on which 
each halo hosts, on average, one satellite. Therefore, the dis- 
tribution of satellite luminosities brighter than some Lmin, 
that are in halos of mass M, is simply 



d(HOD) 



(> L|M, Lmin) 



[Af/Mi(L)]"(-^' 



(10) 



[M/Ml(Lmin)]°('^™ ) 

Matching the observed luminosity dependence of clustering 
requires that 

W^h-^^Me "" 23 A/min(L) = 23 [exp(Lio) - 1] , (11) 

where Lio is the luminosity in units of 10^''/i~^Lq, and a ~ 
1 approximately independent of L (it increases by about 
20 percent when Lmin increases by 2 orders of magnitude). 
The fact that a is approximately independent of L implies 
that the distribution of satellite L must be approximately 
independent of halo mass M, and this is in good agreement 
with measurements in the SDSS (Skibba et al. 2007). 

The inverse of the Mmin(L) relation (equation lll|) in- 
dicates how the luminosity of the central galaxy scales with 
halo mass: 



Lcc 



1.1 X lOio/i-^Lg 



In 1 + 



M 



W^h-^MQ 



(12) 



Note that although L^cn is a strong function of mass at small 
M, it grows only logarithmically at high masses. This will 
be important shortly. 

There is another implementation of the Halo Model 



which is known as the Conditional Luminosity Function 
(CLF) approach (e.g. Yang, Mo & van den Bosch 2008). 
In this case, one models how the galaxy luminosity func- 
tion depends on halo mass by explicitly postulating that the 
centrals and satellites have different (halo mass dependent) 
luminosity functions. The HOD and CLF analyses yield con- 
sistent conclusions. E.g., Table 2 of Yang et al. (2008) shows 
that their analysis of how the mean number of satellites 
scales with halo mass also yields a ~ 1, and depends only 
weakly on halo mass. Their A/i(L) relation is reasonably 
well described by our equation dill) . And they find that the 
luminosity function of centrals becomes a weak function of 
halo mass at large masses (see their Figure 6). Although 
they parametrize this using a double power-law, we have 
checked that the HOD-based expression above (our equa- 
tion I12|) provides a reasonable description of their fit (their 
equation 6). 

2.3 Extreme values and the Halo Model 

The weak, approximately logarithmic growth of Lccn at large 
halo mass suggests that BCGs might be drawn from an ex- 
treme value distribution that is determined from the full 
galaxy distribution (equation [2] with |6} . In this case, it is 
natural to ask if the luminosity distribution of BSGs is given 
by equation (J4]| with (|6]). 

On the other hand, if the distribution of satellite lumi- 
nosities has an approximately universal form (i.e. halo mass 
determines the overall number but not the shape of the dis- 
tribution), then one might reasonably expect extreme value 
statistics to provide a good description of BSG luminosities. 
In particular, one would therefore expect that the luminos- 
ity distribution of the second brightest galaxy in a group, 
which is the brightest satellite, should be well-described us- 
ing the extreme values of Psat(L), where Psat(L) is the uni- 
versal satellite luminosity function (rather than that for all 
galaxies): e.g., if the HOD parameter a were indeed inde- 
pendent of L, then the BSG distribution should be given 
by using equation (|10p in equation ([2|. However, as we re- 
marked earlier (see Appendix 1X1 for details), this is, in gen- 
eral, different from the expected distribution of the second 
brightest galaxy in a group, under the assumption that the 
BCGs are just the brightest drawn from the universal galaxy 
luminosity function. 



3 COMPARISON WITH SDSS DATA SETS 

To address these issues, we use the Mr20 catalog from 
Berlind et al. (2006), which is based on the SDSS main 
galaxy catalog. The groups in the Mr20 catalog are derived 
from a volume limited sample containing galaxies with spec- 
troscopic redshifts between 0.015 < z < 0.1, with an r-band 
absolute magnitude (fc-corrected and evolution corrected to 
z = 0.1) brighter than Mo.ir — Blog^gh ~ — 19.!|j, corre- 



^ The evolution correction brightens the magnitude of the p^ 
object according to A/o.ir,j — > A/o.ir,j -I- Q{zj — 0.1), where Q = 
1.62 is taken from Blanton et al. (2003, their Figure 5). Note 
that the values reported in the public catalog are not evolution 
corrected; all our results use the corrected absolute magnitudes. 
Also, we find that the value of the absolute magnitude threshold 
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Figure 1. Luminosity distribution of the Mr20 galaxy catalog 
from Bcrlind et al. (2006) which contains groups with N > 2 
members (histogram). Solid curve shows equation ||6ll) with pa- 
rameters from equation Q. We also display the p- value from a 
KS analysis comparing the data with the solid curve. For compar- 
ison, dashed curve, which predicts a slightly fainter distribution 
than observed, shows the Schechter distribution with parameters 
from Blanton et al. (2003). 



sponding to luminosity Lmin = 0.747 x 10 LqH' . The 
groups are identified by the Friends-of-Friends algorithm, 
corrected for fiber collisions and survey-edge effects (see BOG 
for details), which results in 4107 groups containing N > 2 
members, i.e. A^^at > 1 satellites. As noted by B06, however, 
the smaller groups with A'' < 10 are affected by systematic 
biases in the identification algorithm. We therefore restrict 
our extreme values analysis to the subset of the Mr20 cata- 
log containing groups with A^ > 10 members, of which there 
are 332. The catalog contains luminosity information for all 
member galaxies. 



3.1 Group-galcixy luminosity distribution 

We begin by asking whether the galaxies in the Mr20 catalog 
are consistent with being drawn from a universal luminosity 
distribution. Figure [T] shows the galaxy luminosity distribu- 
tion of the full Mr20 catalog, which has groups containing 
N > 2 members. The solid curve shows equation ^ with pa- 
rameters adapted from Bernardi et al. (2010) (equation [T]). 
It describes the luminosity distribution in the Mr20 sample 
remarkably accurately, especially given that this fit was ob- 
tained from the full galaxy sample - not the subset which 
satisfy the B06 selection criterion. We also display the p- 
value from a Kolmogorov-Smirnov (KS) analysis comparing 
the solid curve with the data. 

For comparison, the dashed curve shows equation ((6]) 



in the public catalog after evolution correction is —19.924, which 
is what we adopt. 



0.1 



30.01 



0.001 




L(10"'l,~ 



Figure 2. Luminosity distribution of subsamples of the Mr20 
galaxy catalog containing groups with A' > 10 and Af > 15 mem- 
bers (histograms). Solid curve in each panel is the same as in 
Figure [T] and shows Pgai(i') (equation |6] using equation [7]l . This 
same function provides an accurate description of each subsam- 
ple. We also display the p-values from a KS analysis comparing 
the data with the solid curve. 



with parameters derived from Blanton et al. (2003) (equa- 
tion [S]) . This form predicts somewhat fainter galaxies than 
are present in the BOG sample. This is not surprising given 
previous work showing that that the Schechter parametriza- 
tion is a poor description of the bright-end of the all-galaxy 
sample, and groups are expected to preferentially host the 
brightest galaxies. (A KS analysis in this case gives a negli- 
gibly small p- value.) 

As mentioned previously, we do not expect to obtain 
robust results for groups with A*' < 10 and we will hence- 
forth discard these objects. Figure [5] shows the luminosity 
distributions in two subsamples constructed by restricting 
to A' > 10 and A' > 15 members. Equation ([SJ with param- 
eters from equation is an accurate description of each 
subsample. We show the KS p-values from a comparison of 
each subsample with this distribution. This suggests that 
these BOG group galaxies are in fact drawn from a universal 
luminosity distribution which is approximately independent 
of group richness and halo mass. 

It is not a priori obvious that this distribution should 
have been universal, nor is it obvious that this universal 
function should have simply been the all-galaxy luminosity 
function (truncated to only include objects that are brighter 
than Adr < —19.9). The latter is worrying, because it might 
arise as follows. Separating true group members from inter- 
lopers on the basis of angular position and redshift space 
information alone is known to be difficult. Using colors to 
help identify members of the same group helps significantly, 
but the BOG algorithm does not use color information. So 
it is conceivable that the algorithm returns an accurate es- 
timate of the number of members in a group, without ac- 
tually identifying the members themselves correctly. In this 
case, it is possible that objects identified in the catalog as 
group members are effectively random samples of the un- 
derlying galaxy distribution - with the choice of link-length 
parameters having been (too) strongly influenced by theo- 
retical expectations about what the distribution of A'' should 
be. If this is indeed why the all-galaxy luminosity function 
describes the BOG luminosities, then it would invalidate a 
number of published analyses based on this catalog - partic- 
ularly those which seek to constrain the phenomenon known 
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Figure 3. Luminosity distribution of the BCGs in subsamples 
of the Mr20 catalog with A^ > 10 and A^ > 15 (histograms). 
Sohd curves show the extreme value statistics for the brightest 
object in A'^ independent draws from the distribution Pgai(i) of 
equation JBJ with parameters from equation (O , averaged over the 
observed distribution of N in each subsamplo as in equation I I13I1 . 
We also display the corresponding p-values from a KS analysis 
comparing the data with the solid curves. For comparison, dashed 
curves show the corresponding predictions using the Blanton et 
al. parameters from equation ^. 
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Figure 4. Luminosity distribution of the satellites in subsam- 
ples of the Mr20 catalog with Af > 10 and Af > 15 (histograms). 
Dotted curves show the result of assuming that the satellite dis- 
tribution is given by subtracting from the full galaxy distribution 
the associated extreme value distribution, averaged over the ob- 
served distribution of N (the average of equation IA2I weighted 
by the number of satellites N — 1, over N). The KS p- values as- 
sociated with the dotted curves are also displayed. Solid curve in 
each panel is the full galaxy distribution pgai(L) of equation ^ 
with equation JT}. 



as assembly bias. Section[4]describes a novel test of this pos- 
sibility, which uses mark correlations. 

In the remainder of the paper, when discussing the Mr20 
catalog, we will show results obtained assuming universality 
of pgai(I/) using equation ((BJ with parameters from equa- 
tion (0. 

3.2 Central galaxies as statistical extremes 

We now ask if the BCG of a group with A^ members can 
be described as the brightest of A'' independent draws from 
Pgai{L). Since the number of groups in the Mr20 catalog 
with exactly A'^ galaxies where A'^ > 10 is small, we make 
this comparison by constructing subsamples restricted to a 
minimum value of A'' as in section 13.11 above. In this case, 
the corresponding theory prediction must be averaged over 
the allowed values of A''. We do this by constructing the 
distribution p{N) directly from the given subsample, and 
averaging equation ((2]) over this distribution: 



91 



(gal) 



{L)= E g['^'\L\N)p{N)- 



(13) 



allowed A^ 



FigureOshows the results for subsamples of Mr20 containing 
A'^ > 10 and A'' > 15 members. The solid red curves show 
the extreme value prediction (|13p using equation ((6]) with 
parameters from equation (O, together with the respective 
p-values from a KS analysis. We see a remarkable agreement 
between the predictions and the data. (Of course, p(A'^) is 
different for each subsample.) For comparison we also show 
the predictions when using the Blanton et al. parameters 
from equation ([H|. (In this case the KS analysis gives tiny 
p-values < 10~^.) 

3.3 Satellite luminosity distribution 

We now analyse the satellites in the Mr20 catalog, i.e. the 
data set obtained by subtracting the BCGs from the full 
catalog. The results of the previous section allow us to use 
the satellite luminosity distribution to perform a consistency 



check. If the full galaxy luminosity distribution pgai(L) is 
indeed universal, and if the BCGs obey the extreme value 
statistics of pgai(-L), then the satellite luminosity distribu- 
tion of the subsamples with A' > 10 and A'' > 15 must de- 
pend on the minimum number of members in a specific way: 
the predicted distribution is the average of equation (|A2|) . 
weighted by the number of satellites A'^— 1, over the observed 
distribution of A'^. The dotted curves show these predictions; 
they provide a good description of the data. The associated 
p-values from a KS test are also shown. 

The solid curves (same in each panel) show pgai(I/) from 
equation ^ with equation ((T)). The analysis in Appendix 1X1 
predicts that the satellite distributions at large A'^ should 
become approximately universal, asymptotically approach- 
ing the distribution Pgai(-L), and we see that this is indeed 
the case. 

3.4 Brightest satellites as second brightest objects 

The brightest satellites are, in a sense, more interesting than 
the BCGs, because there are two statistical contenders for 
describing their luminosity distribution. 

The first is motivated by the fact that BCGs appear 
to be consistent with extreme value statistics (equation [6] in 
equation U) . This suggests that the BSGs ought to be well 
described as the second brightest of N independent draws 
of the universal galaxy distribution pgai(I/) (equation|6]with 
equation [7] in equation |3|. Figure [S] shows that this is indeed 
the case. 

3.5 Brightest satellites as statistical extremes 

The alternative model is that BSGs are extremes of some 
universal satellite distribution, as suggested by the HOD 
model, and by the measurements which suggest that the 
overall satellite luminosity function is basically independent 
of group richness (Skibba et al. 2007; Hansen et al. 2009). 
Figure [6] shows this comparison for two choices of the uni- 
versal distribution Psa,t{L). More precisely, we compare the 
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Figure 5. Luminosity distribution of the brightest satellites in 
subsamples of the Mr20 catalog containing TV > 10 and TV > 15 
members (histograms). Sohd curves show the order statistics for 
the second brightest object in TV independent draws from the 
distribution Pgai(i) of equation l(6]l with equation ^, averaged 
over the observed distribution of TV, similarly to equation l|13p 
but using the extreme value distribution from equation Q. The 
corresponding KS p- values are also displayed. 
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Figure 7. Distribution of the luminosity gap AM for subsamples 
of the Mr20 catalog with TV > 10 and TV > 15 (histograms). 
The solid curves are the corresponding predictions of extreme 
value statistics equation I I17II . averaged over the corresponding 
observed distributions of TV and using Pgai(^) from equation ^ 
with equation 10 . For comparison, we also show the predictions 
based on parameters from equation ^. 
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Figure 6. Luminosity distributions of the brightest satellites in 
two subsamples of the Mr20 catalog: histograms are the same as 
in Figure \5\ Solid black curves show the extreme value statis- 
tics for the brightest object in TVsat independent draws from the 
distribution Pas,t{L). We used extreme value predictions of the 
corresponding dotted curves of Figure l4l (i.e.. (?isat(i|TV) from 
equation lA3l averaged over the observed TVsat values). Dashed red 
curves show the analogous extreme value distributions based on 
the HOD model of Zehavi et al. (2011) described in Appendix [Cl 



brightest satellite distributions for the two subsamples (the 
histograms are the same as in Figure [5]), with the extreme 
value distribution appropriate for the brightest of A'^sat inde- 
pendent draws of an assumed universal satellite distribution 
Psat(i), averaged over the observed distribution of A^sat val- 
ues (solid and dashed curves, see below). The resulting ex- 
treme value distribution is identical in form to equations (|13p 
and ([2]), with TV — ^ TVsat and Pgai(i) — s> Psat(i)- These pre- 
dictions depend on psat(i)- 

The solid curves show the result of using our analytic 
calculation of Psat(i) (dotted curves in Figure |4]) to provide 
a calculation of gia^t{L\N) (from equation IA3|) which we 
then average over the observed distribution of A^sat- The 
solid curves provide a good description of the measurements, 
with KS p-values comparable to those in Figure [S] this is 
not surprising since we are at large A'^, where we know that 
<?isat -^ ff2gai (see discussiou at the end of Appendix [X)) . 
Unfortunately then, this large TV dataset is not suitable for 
distinguishing between these two predictions. 

The dashed curves show the result of using the HOD 



result equation (|C7|I . We see that this tends to always pre- 
dict brighter distributions than the ones observed (the corre- 
sponding KS p- values are ~ 10~^). However, since the HOD 
prediction relies on parameter values derived from statisti- 
cal fits (Zehavi et al. 2011), we would caution against taking 
these results at face value. A more complete treatment would 
involve accounting for parameter errors in the HOD fit, but 
this is beyond the scope of this work. 



3.6 The luminosity gap 

If both the BCG and BSG luminosity functions are given by 
Order Statistics, then the joint distribution of their magni- 
tudes p{Mi,M2\N) (with Mj being the f^ brightest) is 

piMi,M2\N) = A^(A' - l)pgai(Mi) 

X Pgal(Af2)Pgal(> M2)'^-'e(M2 - Ml) , 

(14) 

with Q{x) the Heaviside distribution, and we have used 
the same notation pgaj as before to denote the uni- 
versal galaxy luminosity distribution, with pgai(m) = 
I/(m)pgai(I/(m)) ln(10)/2.5. This allows us to derive the dis- 
tribution of the luminosity gap, defined as the difference in 
magnitudes AM = M2 — Mi of the second brightest and 
brightest objects. Before considering this distribution, we 
note that another interesting distribution is that of the sec- 
ond brightest M2, for a fixed value of Mi. This is given by 



p{M2\Mi,N) = 



p{Mi,M2\N) 



p{Mi\N) 

= (TV-l)pgal(M2) 



Pgai(>M,^^-2 



-Pgal(> Ml 



-e(M2-Mi) 

(15) 



As Afi is made more negative, i.e. as the BCG is made more 
luminous, Pgai(> Mi) — >■ 1, and this distribution simply 
asymptotes to that of the brightest of TV — 1 draws from the 
universal galaxy distribution, independent of Mi. In other 
words, the typical gap AM increases as the BCG is made 
more luminous, illustrating what is known as the Bautz & 
Morgan (1970) effect. We can therefore understand the latter 
as a straightforward consequence of order statistics. 
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The distribution of the luminosity gap AM for fixed A^ 
is predicted to be 

p(AM|iV) = N{N - 1) /'dmpgai(m) 

xPgai(>m)^-"pg,i(m-AM). (16) 

As with the earher distributions, we can define averages of 
the fixed- Af gap distribution over subsamples, as 



p(AM) = Yl p{AM\N)p{N) . 



(17) 



Figure [7] compares the averaged prediction in equation (|17p 
with the subsamples of Mr20 having A^ > 10 and A'' > 15. 
We see a good visual agreement between the data and the 
prediction based on equation ^ with equation (O (solid 
curves), except at small AM, where the data show a number 
of groups with AAf precisely equal to zero. We have checked 
that all these systems correspond to fiber collided BCGs, 
and the algorithm of BOB then assigns the brightest satellite 
both the redshift and the absolute magnitude of the BCG. 
The KS p-values comparing the solid curves to the full data 
(i.e., including the cases with AM — 0) are negligibly small. 
However, comparing the same curves after subtracting the 
spike from the data leads to large p-values (> 0.1). In either 
case (with or without the spike), using equation ((6]) with 
equation ^ provides a better description of the large AAf 
tail than using equation ([S]). 



4 MARK CORRELATIONS AS A NOVEL 
TEST OF ORDER STATISTICS 

The previous section showed that the distribution of group- 
galaxy luminosities was very like that of all galaxies 
(whether or not they are in groups). This raised the question 
of whether or not the group catalog has correctly separated 
group members from the field. We will use mark correlations 
to address this and related questions. 

The mark correlation WW (r) / DD{r) is a statistic as- 
sociated with pairs of objects separated by r; DD{r) is the 
number of pairs of separation r, and IWF(r) is the result 
of weighting each galaxy i by some weight Wi/W, (where 
W is the average weight of the sample) when performing 
the pair count. In what follows, we will use the luminosity 
as a weight. If there is no correlation between a galaxy's 
luminosity and its position then WW/ DD should be unity; 
departures from unity indicate that the luminosities and po- 
sitions are correlated. 

We begin with the objects in the Mr20 group catalog. 
If the group-identification process ended up assigning ran- 
dom galaxies to groups, then the catalog which results from 
scrambling the list of luminosities among all the members 
of the original catalog should be stastically the same as the 
original one. Therefore, we measure the luminosity mark cor- 
relation WW/ DD in this scrambled catalog. We find that 
WW/DD ~ 1 as it should. Repeating this scrambling pro- 
cedure a few times and averaging the results gives some in- 
dication of the range of values around unity that one might 
expect given the observed set of L and positions, if the lumi- 
nosities and positions are indeed uncorrelated. The dotted 
curves in Figure [8] show this range, which we will use as a 
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universal ordered 
universal shuffled 




0.95 



s/h"'Mpc 

Figure 8. Luminosity mark correlation in the Mr20 catalog, re- 
stricted to galaxies which reside in groups with more than 10 
members (red symbols). The fact that this signal is different from 
unity on small scales (< l/i~^Mpc) indicates a significant cor- 
relation between luminosity and location within the group. The 
fact that it is also different from unity on larger scales shows that 
the luminosity function cannot be universal. This is more clearly 
demonstrated by the 'universal ordered' (magenta) and 'shuffled 
in group' (green) curves which separate these two effects (see 
text). The dotted curves ('universal shuffled') are a rough mea- 
sure of the statistical significance of the difference from unity. 



rough measure of the significance of a detection of a corre- 
lation between luminosity and position. 

Next, we scramble the luminosities as before - thus eras- 
ing any possible correlations - but then, within each group, 
we reassign the luminosities among the objects in the group 
so that they have the same rank-ordering as in the actual 
dataset. The resulting measurement of WW/DD shows the 
expected signal if the luminosity distribution were universal 
(i.e. the same for all groups), but the distribution within 
a group depends on location within the group (e.g., if the 
central galaxy is the brightest). The figure shows the result 
of averaging WW/DD over many such 'universal ordered' 
rescramblings (magenta curve). There is a clear tendency 
for WW/DD > 1 at r < l/i~^Mpc, indicating a stastically 
significant correlation between galaxy luminosity and group- 
centric position. Notice, however, that WW/DD declines to 
~ 1 on larger scales; this is a consequence of the fact that 
the typical group size is of order r < l/i~^Mpc, and we have 
erased all correlations on larger scales. (In the Halo Model 
description of mark correlations (Sheth 2005), the two-halo 
term for this case is unity.) 

Finally, the filled red symbols show WW/DD in the 
original (unscrambled) group catalog. On small scales, the 
signal is similar to that in the 'universal ordered' groups, but 
it clearly differs from unity even on larger scales. The fact 
that WW/DD 7^ 1 on > l/i"^Mpc scales suggests that the 
luminosity function itself is not universal. To separate out 
the effects of the group-centric trends from possible group- 
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group trends, we have performed a final scrambfing. We 
keep the hst of luminosities within a group the same, but, 
within each group, we scramble the luminosities amongst 
the group members. In this way, we remove that part of the 
signal from WW/ DD which is due to the correlation with 
group-centric position. Notice that now WW/DD ('shuffled 
in group', green curve) is like the original on large scales, 
but it does not increase as much on smaller (< l/i~^Mpc) 
scales. 

Thus, our mark correlation analysis has shown that i) 
group-centric position matters; and ii) even once this has 
been accounted for, the luminosity function is not universal 
for all groups. The first point derives from the fact that the 
'universal ordered' signal is not unity on small scales, and 
it is very similar to the original one, and the 'shufHed in 
group' signal differs from the original unscrambled one at 
these small scales. This strongly suggests that the brightest 
cluster galaxy does indeed lie closer to (if not at) the group 
center. And the fact that, on larger scales, the actual signal 
differs so significantly from the 'universal ordered' one (i.e. 
from unity) indicates that the luminosity function cannot be 
the same for all groups. This was not at all obvious from the 
(more traditional) analysis of the group and BCG luminosity 
functions in the previous section. 



5 CONCLUSIONS 

Brightest cluster galaxies are interesting for several reasons, 
the simplest being that they are the brightest and there- 
fore most easily observable of galaxies. As mentioned in the 
Introduction, it is also becoming increasingly certain that 
they are physically distinct in several other ways, e.g. in 
their morphology, velocity dispersions, etc., from the other 
galaxies (the satellites) which reside in groups and clusters. 
We argued that, from the point of view of Halo Model anal- 
yses, it is interesting to ask whether the luminosities of the 
BCGs retain any signature of their special nature. Our anal- 
ysis shows that, at the level of one-point distributions, the 
answer appears to be 'no': BCG luminosities (insofar as they 
are accurately represented in the Mr20 catalog that we anal- 
ysed) are consistent with being the statistical extremes of 
a universal luminosity function, which is well-described by 
equation ([6}. 

We also analysed the luminosity functions of the satel- 
lites in the Mr20 catalog. The behaviour of these distribu- 
tions for different values of group richness (i.e. number of 
members) N is especially interesting, since it potentially al- 
lows us to distinguish between two seemingly reasonable but 
incompatible predictions of Halo Model analyses. On the 
one hand, since the BCG luminosities are simply statisti- 
cal extremes of a universal galaxy luminosity distribution, 
it is reasonable to expect that (a) the satellite luminosity 
function is just that given by subtracting the BCGs from 
the universal one (equation I A2|) . and (b) the brightest satel- 
lite (BSG) luminosities are simply the second brightest of 
A'^ draws of the universal galaxy distribution. On the other 
hand, the HOD model predicts that (c) the satellite function 
is itself approximately universal, and (d) the BSGs are sta- 
tistical extremes of this function. We showed that these two 
sets of predictions are inequivalent in general. In the present 
case, however, the large values of N that we consider prevent 



us from distinguishing between these two predictions, each 
of which provides an acceptable description of the data. 

Additionally, we performed other tests to check the con- 
sistency of our results. We showed that the statistics of the 
luminosity gap (the difi'erence in absolute magnitudes of the 
BSG and BCG) in the Mr20 catalog are consistent with the 
predictions of order statistics (equation I16|) . In Appendix B 
we argued that statistical tests for the mean BCG luminos- 
ity based on fixed N are to be preferred over those based 
on, say, fixed total group luminosity Ltot, showing how the 
latter may lead to biased conclusions. We also emphasized 
that, in order to perform unbiased tests, it is crucial to use 
accurate descriptions of the bright tail of the galaxy lumi- 
nosity function. 

However, we showed that, at the level of two-point 
statistics, the BCG and BSG luminosity distributions are 
inconsistent with their having being drawn from a univer- 
sal luminosity function (i.e., one that is the same for all 
groups) . We argued that because the luminosity mark corre- 
lation function differs significantly from unity on large scales 
(Figure [8]) the luminosity function of groups must depend 
on group properties (e.g. mass). This was not at all obvious 
from the (more traditional) one-point analysis of the group 
and BCG luminosity functions themselves. 

In work in progress, we show that the extreme values 
assumption together with a universal luminosity distribu- 
tion Pgai(L) predicts a halo mass dependent BCG luminosity 
function g\{L\m) — X]jvP(-'^l'^)5i(-^l-^) if f he distribution 
of N depends on mass m. This would have immediate conse- 
quences for implementations of the Halo Model such as the 
Halo Occupation Distribution (HOD, Zehavi et al. 2005), 
since such analyses would considerably simplify upon impos- 
ing the extreme value restriction on the BCG luminosities. 
However, we have already seen that the assumption of uni- 
versality is inconsistent with the observed mark correlation 
signal on large scales. One might then wonder if the correct 
solution is to replace pgai(L) — > pgai(L|m). In this case, the 
BCG luminosity function averaged over N is mass depen- 
dent because both Pgai (and hence gi{L\N, m)) and p{N\m) 
depend on m. In this case it is no longer obvious that the 
extreme values hypothesis should be tested at fixed N as we 
argued earlier. Formulating the problem cleanly in this case 
is work in progress. 

Additionally, as discussed in the Introduction, this be- 
haviour of the BCG luminosities, in particular, the fact that 
their luminosity functions are narrower than those of all 
galaxies, potentially allows BCGs to be used for consistency 
checks of standard candle constraints on the luminosity- 
distance relation. As a final remark, we note that, while 
BCGs have brighter and narrower luminosity distributions 
than randomly picked galaxies, the distributions of the BSG 
luminosities are even narrower than those of the BCGs. 
From the point of view of using them as standard candles, an 
interesting trade-off then arises: while it is the BCG which 
is most easily observed out to large distances, the BSG is a 
more standard candle. It will be interesting to see to what 
extent these ideas can be implemented in upcoming cluster 
surveys. 
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that Psat(> LIA^) is 



P.at(> L\N) 



K 



>L 



•'^gliV 



•'^BCG ]V 



•'^satliV 



•'^glJV 






{NPU>L)-gi{>L\N)) 



N - 1 

^ iVPgal(<L)-Pgal«L)^ 



PM>L) 



N - 1 

Pgal(<L)-Pgal«L)^ 



N - 1 



which gives a satellite luminosity distribution 
Ps.t{L\N) ^ 



{l^l/NY 



Pgal(L) l-Pgal(<L)^ 



(Al) 



(A2) 



Notice that for large enough richness values A*', 1 — 1/A*' — >■ 
1 and Pgai(< L)^~^ -^ 0, for all interesting values of L, 
and hence pBs.t{L\N » 1) — !> Pgai(L). Figure |4] shows that 
averaging this distribution over the observed distribution of 
A'^ provides a good description of the distribution of satellite 
luminosities in the BOG group catalog, especially at large A'^. 
The extreme value statistics associated with this distri- 
bution yields 



N ■ 



(A3) 



which is clearly different from g2{< L\N) in the main text. 
Although they are different for general A*', it is easy to show 
that 



gis.ti< L\N) ^ g2{< L\N) as Af > 1; 



(A4) 



i.e., the distribution of the largest of A*' — 1 picks from the 
satellite distribution does indeed tend to that of the second 
largest of N picks from the full distribution. We check this 
explicitly in Figure [S] of the main text. 



APPENDIX A: PREDICTION FOR THE 
SATELLITE LUMINOSITY DISTRIBUTION 
FROM EXTREME VALUE STATISTICS 

If the luminosity distribution of galaxies pgai(I/) is univer- 
sal, and if the BCGs in groups containing A'^ members are 
described as the brightest objects of N independent draws 
from this distribution, then the distribution of the satellites 
in such groups must have a specific dependence on A'^, as we 
show below. 

Consider a sample of A/"^^"'" galaxies brighter than 

Lmin, residing in A/'j^^'"'° groups (halos) which contain ex- 



NAf, 



>L„ 



Since 



actly N members each, so that A/''j'jy" — j'^vj ,^ 

Pgai(L) is universal, the number of these galaxies brighter 

than L, is JV^,^ = Af \f/°'"Pgai{> L). Similarly, the number 



of BCGs brighter than L is M^qqij^ 



■^h^^"""5i(> L\N), 
where (?i(> L\N) = 1 — Pgai(< L)^ follows from equa- 
tion (O. 

The satellites are obtained by subtracting the BCGs 
from the full sample, and the number of satellites brighter 



than L is therefore N^J^,j^ = -^^in ' 



•^BCGIiV' which tells us 



APPENDIX B: A SHUFFLING-BASED TEST 
OF EXTREME VALUE STATISTICS 

In the main text, we argued that equation (O, with pa- 
rameters from Bernardi et al. (2010) provided a good fit to 
the distribution of luminosities in the B06 group catalog. 
Here, we provide additional tests of this assumption. We 
also make two points: First, when one is interested in de- 
riving constraints from the extreme tails of a distribution, 
then it is important to have an accurate description of the 
tails. Second, when testing extreme value or, more generally, 
order statistics, the test is best done keeping A'^ fixed. 

A simple test of whether or not BCG luminosities are 
unusual compared to those of the other galaxies in the group 
is to make a mock catalog by scrambling the list of galaxy 
luminosities among the different groups. The BCG luminos- 
ity distribution in the mock catalog represents the expected 
distribution if BCGs were not otherwise special, and com- 
parison with the BCGs in the original dataset indicates if 
they are indeed special. In practice, we generate 500 realiza- 
tions of the scrambled mock catalog, so as to produce a less 
noisy estimate of the expected extreme value distribution. 

The B06 groups are characterized by two numbers: the 
total number of galaxies, A'^, and the total luminosity of the 
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Figure Bl. Mean value of Lbcg ^.s a function of group size N (left) and Ltot (right) in the B06 Mr20 catalog (symbols). (Left panel): 
Dottcd-dashed line shows the mean relation between Lbcg S'ld ^ in 500 mock catalogs obtained by scrambling the observed list of 
galaxy luminosities and constructing groups such that the observed distribution of A'^ values is reproduced (see text). Solid red curve 
shows the mean prediction of extreme value theory based on equation ^ with equation (0, while red short-dashed curves indicate 
the corresponding region enclosing 95% of probability for each N. The data, mocks and theory predictions are in excellent agreement. 
Blue long-dashed and corresponding dotted curves show the mean and 95% region of extreme value theory based on the Schechter form 
of Blanton et al. (2003) equation (|8), and emphasize that using an inaccurate description of the bright tail of the galaxy luminosity 
function can lead to erroneous conclusions regarding, e.g., the statistical significance of outliers. (Right panel:) Solid line shows the mean 
relation between Lbcg ^^'^ ^tot in the same mocks which gave the dotted-dashed curve of the left panel. The data agree very well with 
these mocks, although they seem to lie systematically slightly higher than the curve at small Ltot. Dashed curve shows the same mean 
relation, but in mocks generated by fixing an observed Ltot value and randomly sampling the observed galaxy luminosity list until the 
total luminosity exceeds this Ltot (see text for more details). This procedure, in general, does not reproduce the observed distribution of 
N, and leads to biased conclusions regarding the mean relation. The dotted curves explicitly demonstrate this bias for the exponential 
luminosity distribution P(> L) = e"'^"^™'"" ^* by plotting ( LeCG l^tot ) for two choices of the distribution of richness p{N) (see text). 



group Ltot (simply the sum of the luminosities of A'' group 
members). When generating the mock catalog, we must de- 
cide whether to keep A'' or Ltot fixed. Order statistics are 
clearly a function of A'', so N is the natural variable. How- 
ever, one might be concerned that the luminosity of the BCG 
depends on 'hidden' variables, such as halo mass. If so, if one 
believes Ltot is a better indicator of this hidden variable, 
then one might wish to make mocks by holding it, rather 
than A'^, fixed. We will show results of doing both, but note 
that the results in the main text strongly suggest that the 
test at fixed A'^ is the prefered one. 

To generate a mock catalog, we first randomly scramble 
the observed list of galaxy luminosities (which we take from 
groups with A'^ > 10) . We then run over the observed values 
of A'', sequentially picking A'' luminosities from the scrambled 
list for each A'^, and store both the sum and the largest of 
these A'^ picks. We then move on to the next value of A'^. 
This procedure, by construction, reproduces the observed 
distribution of A'^ values. 

The dotted-dashed curve in the left hand panel of Fig- 
ure IB II shows the resulting correlation between A'^ and I/bcg 
in the mocks. This curve represents the extreme value pre- 
diction for the Lbcg , if BCGs are just the extremes of the 
group-galaxy luminosity function. The symbols show the ac- 
tual correlation in the Mr20 catalog (wherever possible, we 



average over the BCG luminosities at fixed A'^). We have 
deliberately chosen to show the log of Lbcg because the ex- 
treme value distribution has a long tail to large L - showing 
the log brings the visual impression of the scatter around 
the mean value closer to the correct one. 

The solid curve shows the associated prediction for this 
correlation if we insert equation (|6]) in equation ([2} . It lies 
very close to the dotted-dashed curve. This is consistent with 
one of the points made in the main text: that the all-galaxy 
luminosity function, equation ((Sjl with equation (O, provides 
an excellent description of the Mr20 group-galaxy luminosi- 
ties. (The fact that it is systematically beneath the dot- 
dashed curve is consistent with the fact that the galaxies in 
this sample are slightly brighter than predicted by pgai(I/).) 
The short dashed curves on either side of it indicate the 
region which contains 95% of the probability (at each A'^), 
also calculated from equation ([2]). We note that none of the 
observed groups lie outside this region. 

The other dashed curve, and associated 95% regions, 
shows the extreme-values prediction if we use the Schechter 
function of Blanton et al. (2003), equation ((8)1, instead. This 
curve falls below that of our mock catalogs. Since a number 
of the Mr20 points lie outside its 95% band, had one used 
this curve, rather than performing the full shuffling proce- 
dure, one might have concluded that the BCGs in the Mr20 
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catalog are inconsistent with the extreme values hypothe- 
sis. However, this conclusion is unwarranted, because the 
Schechter function does not describe the bright-end of the 
group-galax;y distribution very well (see Figure[T]). Thus, the 
exercise above illustrates the importance of using accurate 
measures of the tail, if one intends to use the tail to draw 
important conclusions. 

The panel on the right shows the correlation between 
Lbcg and Ltot in the Mr20 catalog (symbols) and in the 
mock catalogs which were used to produce the panel on the 
left (solid curve). The agreement is very good, at least at 
large Ltot, again indicating that BCG luminosities are con- 
sistent with just being statistical extremes. We have also 
checked that the mean Ltot at fixed A'^ is close to being pro- 
portional to N for this catalog, with the best fit power law 
relation being Ltot = 1.66 x W^°LQh-'^N°-^'^. 

The dashed curve, which lies above the measurements 
at small Ltot but close to the solid curve at larger Ltot was 
obtained by generating mock catalogs following a slightly 
different procedure - one that is essentially the same as that 
recently used by Lin et al. (2010) in their analysis of BCG 
luminosities in another group catalog. In this case, for each 
observed Ltot, we continue to draw luminosities from the 
group galaxy distribution until the sum of these luminosi- 
ties first exceeds Ltot- We then compare this value and the 
one preceding it to the observed value, and assign the closer 
of the two to the mock group. We also determine the lumi- 
nosity of the mock BCG from the list of luminosities which 
contribute to the group's luminosity, before moving on to 
the next value of Ltot- To obtain smoother results, we run 
over the observed distribution of Ltot values 500 times. 

In this case, there is no guarantee that the mock cat- 
alog which results has the correct distribution of A'^. Since 
extreme value statistics are explicitly a function of the num- 
ber of draws A'' (e.g. equations [2] and |4]) , it seems unlikely 
that this procedure can yield a fair test of these statistics. 
Indeed, the difference between the dashed and solid curves 
in this panel is a measure of the bias in this particular test, 
since the solid line is associated with mock catalogs that, as 
shown in the panel on the left, are in excellent agreement 
with the expected extreme value distribution. 

This is easier to appreciate with a simpler example. 
When the universal function p(L) is an exponential, P(> 



L) 



-(L-L„ 



'"" ^* , then the mean BCG luminosity at fixed 



Ltot and A'' takes the form 



( Lbcg |Ltot ,N) — Lmii 
(defined for Ltot > A^Ln 



+ — (i. 



where Ha 



NLtnin) (Bl) 

Er=i(iA) 

is a harmonic number. The mean BCG luminosity is 
then (LBCclLtot) = Eiv(-'^BCG|Ltot, A^ )p(A/'|Ltot). The 
distribution p(A'|Ltot) is constructed as p(A''|Ltot) = 
p{Ltot\N)p(N)/ Eiv' piLtot\N')p{N'), where 



L.p(LtotlA') 



3(itot-JV_L„i„)/L. 



NL^ir 



L. 



which is also defined for Ltot — NL^nin > and is zero oth- 
erwise. 

One therefore needs to choose a "prior" distribu- 
tion p{N), and the dotted curves in right panel show 
(Lbcg I Ltot) for the exponential luminosity distribution 
when p(A'^) is the one observed in the data (lower, red) and 



in the mocks described above which kept Ltot fixed (upper, 
black). We see that the difference between the curves has 
the same sense as that for the mocks based on the actual lu- 
minosity distribution. This explicitly demonstrates the bias 
introduced by using the wrong distribution of A'^. 

We conclude that the BCG luminosities in the Mr20 
group catalog are consistent with being the statistical ex- 
tremes of the group-galaxy luminosity function - the latter 
being very well described by the all-galaxy luminosity func- 
tion. 



APPENDIX C: THE HOD MODEL OF ZEHAVI 

et al. (2011) 

The HOD implementation of Zehavi et al. (2011) 
parametrizes the mean number of galaxies { A'^l > L,m) 
brighter than some threshold L in a halo of mass m using 

{N\ >L,m)=/oen(>L,m)(l + Afsat(>L,m)) , (CI) 

where 

r r^ r N 1 A , „ /logm-logAfmin(L) 

/con(> L, m) = - 1 + erf 

2 \ \ criogM(L) 



Afsat(> L,m) = 



Mo(L)\"'^' 



Mi{L) J 



(C2) 
(C3) 



The function /ccn(> L,ra) is interpreted as the number of 
centrals, and the product of /ccn(> L, m) and iVsat(> L, m) 
as the mean number of satellites in the m-halo. It is natural 
to interpret these functions in terms of conditional prob- 
abilities, with /cen(> LjTn) being the fraction of m-halos 
that host a central galaxy (note that it varies between and 
1 by definition) and Nsa.t{> L,m) being the mean number 
of satellites in halos that host a central. The distribution 
of the number of satellites brighter than L in an 77i-halo 
which hosts a central is assumed to be Poisson with mean 
A'sat(> L,m). 

The values of the parameters Mmin , criog m , Mo , Mi and 
a at various thresholds are then fit by comparing a Halo 
Model calculation of the projected 2-point correlation func- 
tion to measurements of luminosity dependent clustering 
in SDSS data. The results are in Table 3 of Zehavi et al. 
(2011), of which we use information for the 5 threshold val- 
ues brighter than —19.9. The HOD prediction for the satel- 
lite luminosity distribution is 



L'sat(> LIm,Lmin) 



AfBat(> L,m) 

iVsat(> Lniin,m) 



(C4) 



Since this depends on mass, obtaining predictions for the 
distribution at fixed A'sat requires an average over the mass. 
Following Skibba et al. (2007) we define the number density 
of groups with A^sat satellites 

71grp(A'sat|Lmin) = / dm——fccn{>Lmin,m)p{Nsa.t\'ni,Lmin); 

(C5) 
where dn/dm is the halo mass function and p(A'^sat|"i, Lmin) 
is a Poisson distribution with mean Afsat(> Lmim'Ti). For 
brevity we drop the reference to Lmin in what follows. 

The satellite luminosity distribution at fixed A'sat is 
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then given by 

P,at(>L|iVsat)= f 



dm 



dn fccn{> I/miii,m) 



dm 



,(iV., 



X p(Af,attm)P,at(> L\m) . (C6) 



This function is then used to compute an extreme value dis- 
tribution gisa.t{L\Ns!!Lt), which is identical in form to equa- 
tion (111), with iV -> iVsat and Pgal(i) -> Psat(i|A''sat) = 

—dPsat{> L\Ns!!,t)/dL. To be consistent, the prediction 
for A'aat > A'min luust be averaged using the distribution 
p(A'sat) = ngrp(A''aat)/ngrp wherc Mgrp = J dmdn/dmf ccni> 
Linin,m). In other words, 

E]V,,-,t>Ar„i„fflsat(i|A^Bat)ngrp(iVsat) 
gisatli) = ^^^ TJ-, T . (C7) 



E. 



^grp V^ ' s; 



Note that computing (?isat(i|A'sat) requires taking a deriva- 
tive of Psat(> I/|A'sat)- To cusurc smooth results, in practice 
we first fit simple monotonic forms to the values of the var- 
ious HOD parameters in Table 3 of Zehavi et al. For the 
mass function we use the analytical approximation of Sheth 
& Tormen (1999). We have checked that the luminosity de- 
pendent clustering predicted by our smooth fits to the Ze- 
havi et al. values is close to the measurements in their Figure 
10 and Table 8. We then use L^in = 0.747 x IO^^Lq/i"^ and 
A^min ~ 10, 15 to make our predictions. These are shown as 
the dashed red curves in Figure [6] 
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